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Coherent wide parameter-space searches for continuous gravitational waves are typically limited 
in sensitivity by their prohibitive computing cost. Therefore semi-coherent methods (such as Stack- 
Slide) can often achieve a better sensitivity. We develop an analytical method for finding optimal 
StackSlide parameters at fixed computing cost under ideal conditions of gapless data with Gaussian 
stationary noise. This solution separates two regimes: an unbounded regime, where it is always op- 
timal to use all the data, and a bounded regime with a finite optimal observation time. Our analysis 
of the sensitivity scaling reveals that both the fine- and coarse-grid mismatches contribute equally 
to the average StackSlide mismatch, an effect that had been overlooked in previous studies. We 
discuss various practical examples for the application of this optimization framework, illustrating 
the potential gains in sensitivity compared to previous searches. 



I. INTRODUCTION 

Motivation. The detection of continuous gravitational 
waves (CWs) from spinning neutron stars (NSs) in our 
galaxy remains an elusive goal, despite the global net- 
work of detectors LIGO Q], Virgo and GEO 600 [3] 
having completed their initial and enhanced science runs 
(e.g. see [IHZ])- The search for CWs will likely remain 
a difficult challenge with uncertain prospects even in the 
era of Advanced detectors [BHTU] and third-generation de- 
tectors such as ET [11] . Two main reasons for this are (i) 
astrophysical priors on CWs and (ii) the large parameter 
space of signal parameters to explore (cf. [T2] for a review 
and further references). 

(i) Current astrophysical priors contain large uncer- 
tainties on the expected strength of CW emissions from 
spinning NSs, with a strong bias towards extremely weak 
signals, informed by the population of known pulsars 
as well as by a statistical analysis of a putative galac- 
tic "gravitar" population [TJ]. (ii) The required number 
of templates for a coherent matched-filter search over a 
range of unknown signal parameters typically grows very 
rapidly with increasing duration of data analyzed. There- 
fore only a fraction of the available data can be analyzed 
coherently (e.g. see [TlHTtj] ). 

It was realized early on |17j that in situations where the 
total computing cost of the search is constrained, a semi- 
coherent approach could typically achieve better sensi- 
tivity than coherent matched filtering: shorter segments 
of data are analyzed coherently, then the statistics from 
these segments are combined incoherently. One method 
of incoherent combination simply consists of summing 
the statistics from the different segments, which is typi- 
cally referred to as the "StackSlide" method in the CW 
context (also known as the Radon transform). The tem- 
plate bank used for the semi-coherent combination of 
coherent statistics is referred to as the fine grid, as it 
typically requires a higher resolution than the template 
banks of the per-segment coherent searches (referred to 
as coarse grids). Details of the respective template banks 



will be discussed in Sec. MI Dl 

There are a number of different semi-coherent meth- 
ods: for example, recent work [TS] has shown that Stack- 
Slide sensitivity can be improved by a sliding coherence- 
window approach. A closely related variant to StackSlide 
is the Hough transform [19 , which counts the number of 
segments in which the statistic crosses a given thresh- 
old, instead of summing the statistics. This is generally 
less sensitive, but is designed to be more robust in the 
presence of strong non-stationarities. A somewhat differ- 
ent semi-coherent approach are cross-correlation meth- 
ods, described in more detail in [20 . 

Related to the semi-coherent methods are the so- 
called hierarchical schemes, which consist of following 
up "promising" candidates from a (coherent or semi- 
coherent) search by subsequent, more sensitive searches, 
referred to as "stages". This procedure is iterated until 
the parameter space of surviving candidates is sufficiently 
narrowed down for a fully coherent follow-up using all the 
data. Work on implementing such schemes in practice is 
still ongoing. 

Optimization problem. In this paper we focus on the 
standard single-stage StackSlide method, which was also 
used in previous optimization studies (17L I21j , and is rel- 
atively straightforward to describe analytically. 

Any search method contains a number of tuneable pa- 
rameters, such as the template-bank mismatch, the data 
selection procedure, and the number and length of seg- 
ments to analyze. Hierarchical schemes would further 
require specification of the number of stages and the re- 
spective distributions of computing costs and candidate 
thresholds. The sensitivity of a search generally depends 
on all these choices, and we therefore need to study how 
to maximize sensitivity as a function of these parameters. 

This optimization problem has been studied previously 
by Brady and Creighton [T7] (henceforth 'BC') and sub- 
sequently by Cutler, Gholami and Krishnan [3T] (in the 
following 'CGK'). Both studies have focused on the wider 
problem of optimizing a multi-stage hierarchical scheme 
of StackSlide stages, and have directly resorted to fully 
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numerical exploration of the optimization problem. Here 
instead we focus on the simpler single-stage search, which 
allows us to fully analytically analyze the problem. In the 
next step this can be used as a building block to attack 
the optimization of hierarchical schemes. 

Note that for a network of detectors with different 
noise-floors, the choice of detectors to use at fixed com- 
puting cost is part of the optimization problem, but un- 
der the present assumption of "ideal data" the answer 
can be obtained independently |22j . More work is re- 
quired to develop a practical algorithm to compute the 
optimal search parameters for given data from a network 
of detectors, including gaps, non-stationarities and vari- 
ous detector artifacts. 

Summary of main results. Careful analysis of the sensi- 
tivity scaling shows that the average StackSlide mismatch 
is given by the sum of the average mismatches from the 
coarse- and fine-grid template banks, an effect that had 
previously been overlooked. Note that we allow for inde- 
pendent coarse- and fine-grid mismatches, while BC and 
CGK forced them to be equal as an ad-hoc constraint. 

The analytic optimization is achieved by using local 
power-law approximations to the computing-cost and 
sensitivity functions. The results provide analytic self- 
consistency conditions for the optimal solution: if the 
initial power-law coefficients agree with those found at 
the analytic solution, then the solution is self-consistent 
and (locally) optimal. If this is not the case, one can iter- 
ate over successive solutions or scan a range of StackSlide 
parameters, in order to "bootstrap" into a self-consistent 
optimal solution. 

We find that the analytic solution for StackSlide 
searches separates two different regimes depending on the 
power-law coefficients: a bounded regime in which there 
is a finite optimal observation time, and an unbounded 
regime in which the optimal solution always consists of 
using all of the available data, irrespective of the available 
computing-cost. 

Plan of the paper. In Sec.[ll]we discuss the general CW 
optimization problem, which includes the single-stage 
StackSlide search as the lowest-level building block. In 
Sec.|ni]we derive the sensitivity estimate and computing- 
cost functions for StackSlide searches, and motivate their 
approximation as local power-laws. After deriving in 
Sec. |IV| the general analytical solution and discussing a 
few special cases, we provide examples for the practical 
application of this framework in Sec.[V] directed searches, 
all-sky searches, and searches for CWs from NSs in bi- 
nary systems. 



II. MAXIMIZING PROBABILITY OF A CW 
DETECTION 

The goal for wide parameter-space CW searches for 
unknown signals should be to maximize the probability 
of detection, given current astrophysical priors, detector 
data, and finite computing resources. Conceptually one 



can think of this problem as a hierarchy of two questions: 

(i) What parameter-space P C p(°) to search? More 
generally: how to distribute the total available 
computing power Co over the space P' ' of possi- 
ble CW signals, given astrophysical priors, detector 
data and an (optimal) search method? 

(ii) What is the optimal search method? Namely, 
which method yields the highest detection proba- 
bility P(j et on a parameter-space cell AP if we spend 
computing-cost AC on it? 

The answer to the first question relies on the second, 
but we can analyze the lower-level second question inde- 
pendently of the first. There has been surprisingly little 
work on this problem so far. The first question has hardly 
been addressed at all, except for recent work by Knispel 
[2"5] . The second question has been studied previously 
by BC [17] and CGK [21], assuming a specific type of 
hierarchical scheme, which we refer to as the classical 
hierarchical scheme (CHS). 

In the CHS one performs a hierarchy of semi-coherent 
searches (called stages), starting with a relatively low- 
sensitivity search over the whole initial parameter space 
pW. Promising candidates crossing the first-stage 
threshold are selected and constitute the search sub- 
space 

p(2) c p(i) f or the second, higher -sensitivity stage. 
This is iterated until eventually after m such stages a 
fully-coherent search over all the data can be performed 
on the surviving candidates. At this point one has 
reached the maximal possible sensitivity for a small por- 
tion p( m ) c P*- 1 ) of the initial parameter space. 

Each stage (i) is characterized by its input parameter- 
space pM , a computing-cost constraint Cq and a false- 
(i) 

alarm probability p {J ^. Each stage selects a candidate 
subspace P^ +1 ) C P (l) to follow up in the next stage. 
An optimal per-stage search would result in the highest 
detection probability p^ t for given signal strength h rTns 

and constraints {p$, PW, C^}. The tuneable CHS pa- 
rameters are therefore the number m of stages and the 
per-stage constraints {f>£A> C*o }■ These can be varied 
in order to maximize the overall detection probability 
Pdet(^rms) for the given total signal parameter-space F^°\ 
computing cost Co = Y^Li an d false- alarm proba- 
bility PiA = UT=1 PfA- 

This formulation of the CHS suggests that each stage 
(i) could be considered an independent optimization 

problem for given external constraints {Pf}, P^\ Cg^}, if 
none of its internal parameters interfere with the over- 
all hierarchical scheme. One might contend that the 
parameter-space resolution of the search violates this 
clean factorization: the follow-up space p( I+1 ) from stage 
(i) depends on its parameter-space resolution, which 
might impact the required computing-cost of the next 
stage. However, it is easy to see that (to first order) such 
a coupling is not expected. The number of candidates 
A^cand returned from any stage (except for the last one) 
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will be dominated by the number A/fA of false alarms. 
Therefore Af ca nd ~ AfA ~ PfA-^f, where J\f is the num- 
ber of (approximately) independent templates searched 
in this stage. This can be estimated as AT ~ Vp/vo, in 
terms of the (metric) volume Vp of the parameter space 
P, and the volume vo covered by one template. Therefore 
the number A/" ca nd of follow-up candidates from any stage 
does indeed depend on its parameter-space resolution, 
which depends on the internal stage parameters. How- 
ever, the computing cost of the next stage depends pri- 
marily on the volume of the follow-up parameter-space, 
which is VfA ~ A/fA vq ~ Pia Vp, and is therefore inde- 
pendent of internal stage parameters. It is interesting to 
note that each stage (i) in this scheme achieves a reduc- 
tion of the input parameter-space volume by roughly a 

(i) 

factor of the false-alarm probability pj A , irrespective of 
the internal details of that search. 

The optimal per-stage search method is essentially un- 
known, but following BC and CGK we focus on a known 
good strategy, namely the StackSlide method. While dif- 
ferent semi-coherent methods differ in the details and 
their exact sensitivity, they share the main characteris- 
tics of coherently searching N shorter segments of length 
AT, and combining them incoherently in some way. We 
roughly expect the sensitivity per cost of different meth- 
ods to behave qualitatively similarly to the StackSlide 
method, but more work would be required to study this 
in detail. 



III. PROPERTIES OF A SINGLE-STAGE 
STACKSLIDE SEARCH 



The general StackSlide scheme consists of dividing the 
data (of total duration T) into N segments of duration 
AT = T/N, then performing a coherent matched-filter 
search on each segment and combining these statistics in- 
coherently to a new statistic E by summing them across 
segments. The coherent matched-filter statistic used is 
the ^-statistic, which was first derived in [23] and ex- 
tended to multiple detectors in [25]. Using the same 
amount of data as a fully coherent search, the resulting 
semi-coherent statistic is less sensitive, but substantially 
cheaper to compute over a wide parameter space. At 
fixed computing cost a semi-coherent search is therefore 
generally more sensitive than a fully coherent ^-statistic 
search. 

Notation: we distinguish quantities Q that can refer to 
either the coherent or the incoherent step in the following 
way: we use a tilde, i.e. Q when referring to the coherent 
step, and a hat, i.e. Q when referring to the incoherent 
step. For the following derivations we restrict ourselves 
to a single-detector formalism for simplicity, but we state 
the (trivial) generalization to N& ct > 1 detectors of rele- 
vant results. 



A. The StackSlide search method 

Let k — 1 ... N be the index over segments, and A £ P 
a point in the search space P of signal parameters. The 
"ideal" StackSlide statistic Eq(A) is defined as 



E (A) = ^2J- fc (A), 



(1) 



k=l 



i.e. a simple sum of ^-statistic values {2J r / c (A)}^ =1 com- 
puted at the same template point A across all N seg- 
ments. 

This would require computing the .F-statistic over the 
same template bank as E in every segment. However, 
the metric resolution of Eo is generally finer than that 
of the single-segment ^-statistics [26^ , and therefore this 
approach would spend unnecessary computing cost on 
the coherent F-statistic. In practice T is therefore com- 
puted over a coarse grid of M templates in each segment 
k, and is interpolated in order to sum T on the fine grid 
of Kf > Kf templates (e.g. see [27]). 

Typically the interpolation consists of picking the clos- 
est (in the metric sense) coarse-grid point A&(A) to the 
fine-grid point A from every segment k, i.e. we approxi- 
mate Eq. by 



JV 



E(A) = ^2F fe (A fe (A) )«E„(A), 



(2) 



k=l 



which we refer to as the "interpolating" StackSlide statis- 
tic E. The following sensitivity optimization focuses ex- 
clusively on this interpolating StackSlide method, which 
is the most relevant approach for current practical ap- 
plications. The subtle difference between interpolating 
StackSlide E and ideal StackSlide E with respect to its 
sensitivity and mismatches has been overlooked in previ- 
ous studies, and will be important for the optimization 
problem. 



B. Mismatch and metric 

1. J- -statistic mismatch 

In the presence of a signal timeseries s(i, A s ) with phase 
parameters A s , the statistic 2J 7 / C (A) in a point A follows a 
non-central ^-distribution with four degrees of freedom 
and non-centrality parameter pf,(X s ,X). We denote this 
probability distribution as 



P(2T k \pt)=xl(2T k ;p 2 k ). 
which has the expectation value 



E 



2F fc (A s ,A) =4 + p2(A s ,A) 



(3) 



(4) 



The quantity p k is often referred to as the coherent signal- 
to- noise ratio (SNR). In the case of a perfectly-matched 
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template A = A S! the resulting "optimal" SNR [23] in 
segment k can be expressed as 

rtfc+AT 

pt(X s ,X s ) = -^ I s 2 (t,X s )dt 



2 

S n 
2 



h 



AT 

rmsl • 



(5) 



where tk is the start-time of the fcth segment, S n is the 
(single-sided) noise power spectral density at the signal 
frequency / s . In the second equality we defined the rms 
signal strength /i rms ,fc [21] m segment fc, which is a use- 
ful measure of the intrinsic signal strength in the detec- 
tors, independently of the quality and the amount of data 
used. 

The signal strength /i rm s,/c depends on the intrinsic 
signal amplitude ho, the sky-position, polarization an- 
gles, and detector orientation during segment k. One 
can show [M] |28] that averaging h 2 ^ k isotropically over 
sky-positions and polarization angles yields the relation 
(^ms,fc)sky, P oi = (2/25) h 2 ,. Furthermore, for segment 
lengths of order AT > 0(1 days), the averaging in 
Eq. ([5]) results in /i rms ,fc tending towards a constant over 
all segments. Therefore it will be convenient to approxi- 



mate h r 



Pk( X s,K) 



h lnls , and so we can write 
2 . 



g rms 



AT = ^ pt (AT). 



(6) 



which defines the average optimal SNR p opt for given 
segment length AT. 

Note that this approximation only applies to the 
perfectly-matched SNR pk{X Sl \ s ). The "mismatched" 
SNR /9fc(A s , A) in an offset template A = A s + AA is re- 
duced with respect to the optimal SNR p opt (AT). The 
corresponding relative loss defines the (segment-specific) 
mismatch function Jlk(X s ,X), namely 



p k (X Sl X) = 1 - 



^(A S ,A) 
^pt(AT) 



^■ fe (A s )AA l AA-'+0(AA 3 



(7) 



where Taylor-expansion for small offsets AA defines the 
(coherent) metric tensor <7ij,fc(A) for segment k. The con- 
cept of the parameter-space metric was first introduced 
in |291 130] . and analyzed in the context of a simplified 
CW statistic [14] and the .F-statistic [3Tj . 

The per-segment coarse-grid template bank is con- 
structed under the constraint that no signal point A s G P 
should exceed a given maximal mismatch fh to its clos- 
est (i.e. with the smallest mismatch) coarse-grid template 
Afe(A s ), namely 



Pk 



(x s , Afc(A s 



< m for all A s G 



(8) 



2. Mismatch of "ideal" StackSlide 



The "ideal" StackSlide statistic S defined in Eq. 
is the basis for the definition of the semi-coherent metric 



OH [2S E2]). The statistic E follows a non-central x 2 - 
distribution with 47V degrees of freedom, denoted as 



p(£o|pU =xIn(Zo;p1 )-. 

with non-centrality parameter 

N 



p| o (A s ,A)^5> 2 (A s ,A) 



(9) 



(10) 



k=l 



where A s are the signal parameters and A is the location 
of a fine-grid template. The corresponding expectation 
value is 



E 



So(A s ,A) 



= 47V- 



(ii) 



The perfectly-matched non-centrality parameter 
p\ (A s , A s ) can be expressed as 



Pl ( X s,K) = ^Pfc(A s , A s ) 



k=l 



N P 2 opt (AT) 

Popt( T ) , 



(12) 



which is identical to that of a perfectly-matched J-- 
statistic over the same total duration T, as seen from 
Eq. ^. The reason why the StackSlide statistic E is 
less sensitive than the /"-statistic for the same amount 
of data T stems from the different degrees of freedom 
of the respective distributions, namely xKp 2 ) f° r the T- 
statistic as opposed to x1n(p 2 ) f° r StackSlide S . 

The mismatch function jUo(A s , A) of ideal StackSlide is 
defined in analogy to Eq. ^ as 



p (X s , A) = 1 



P| (A S ,A) 
P 2 opi (T) 



g ij (X B )AX i AX j +0(AX 3 



(13) 



where A A = A — A s is the offset between the fine-grid tem- 
plate A and the signal location A s , and Taylor-expansion 
in small AA defines the (semi-coherent) metric tensor g^. 
Using Eqs. ( 12 ) and d7|), we can rearrange the expression 
for the mismatch as 



p. (X s , A) = — ^ju fe (A s , A) 

k=l 

ra f^E&,*(*.)J AA 4 AA^", (14) 

which shows that the ideal StackSlide mismatch p and 
metric gij are segment-averages of the coherent mis- 
matches and metrics, respectively. 

The fine-grid template bank of a StackSlide search is 
constructed under the constraint that no signal point 
A s G P should exceed a given maximal mismatch fh to its 
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closest (i.e. with the smallest mismatch) fine-grid tem- 
plate A(A S ), namely 



(15) 



JIq A s , A(A S ) < m for all A s e 



3. Mismatch of "interpolating" StackSlide 



AA 



We can now combine the above results to derive the 
mismatch of the interpolating StackSlide statistic £ de- 
fined in Eq. ^ . This statistic follows a non-central \\n 
distribution, namely 



with non-centrality parameter 

N 

p|(A s ,A)^5>2 ^x k (\) 

k=l 



(16) 



(17) 



where A s are the signal phase parameters, and Afc(A) is 
the closest coarse-grid template in segment k to the fine- 
grid point A. 

The mismatch function /I(A S , A) of interpolating Stack- 
Slide is therefore 



£(A S ,A) = 1 



P|(A S ,A) 
pl P t(T) 



1 N 



(18) 



k=l 



which allows us to express the mismatched non-centrality 
parameter as 



(19) 



pi(X s ,X) = 1-m(A s ,A) pU{T) 



The extra offset per-segment, 8X k = X k (X) — A, incurred 
due to using the closest coarse-grid point Afc(A) instead of 
the fine-grid point A tends to increase the mismatch with 
respect to the ideal mismatch function /Iq of Eq. ( 14 ). In 



order to quantify this effect, we write the effective per- 
segment offset from a signal as AAfc = A^(A) — A s , while 
the ideal per-segment offset would be A A = A — A s . We 
can write AA^ = A A + SXk, and inserting this into the 
coherent-metric of Eq. ^ we obtain (neglecting higher- 
order terms O (AA 3 )): 



^ fe (A s , A fc (A)) = g ijik AX l k AX{ 

= /I fc (A s , A) + !hlJ . 5X1 $K + 2?ii,fe A A 1 5X{ , (20) 
where in the first term we recover the ideal per-segment 



mismatch function of Eq. (14 1, the second term repre- 
sents an extra mismatch due to the offset SXk-, while 
the last term depends on the opening angle Q k of the 



A fc (A) 



SX k 



FIG. 1: Mismatch triangle formed by the signal point A s , clos- 
est fine-grid template A, and the coarse-grid template Afc(A) 
closest to A in segment k. 

mismatch triangle (see Fig.jl]), namely 2|AA||<5A/ C | cosOk, 
with mismatch norm defined aS |x| = Qij,k X X"^ . 

We assume that the fine-grid point A falls randomly 
into the Wigner-Seitz cell of the closest coarse-grid tem- 
plate Afe(A) in segment k. Given that the coarse-grid 
metric Qij t k generally varies across segments, we further 
assume that the offset 5X k approximates a uniform ran- 
dom sampling of the coarse-grid Wigner-Seitz cell. In- 
serting Eq. ( 20 ) into ( 18 ) , we see that the average over the 



angle-term |AA||(5Afc| cos 6 k will tend to zero, as any sign 
of cos 8k is equally likely, while the average norm |<5Afc| 2 
will tend to the average mismatch (Jl) of the coarse-grid 
template bank, and so we obtain 



m(A s , A) « /i (A s ,A) + (ju) 



(21) 



When estimating the sensitivity of the interpolating 
StackSlide statistic, we will further average this expres- 
sion over randomly-chosen signal locations A s , and there- 
fore the above approximate averaging expressions will be- 
come exact, and we obtain 



(fl) = (p, ) + (ju) , 



(22) 



where averaging is performed over the coarse- and fine- 
grid template banks (i.e. the respective Wigner-Seitz 
cells) . 

The probability distribution of signal mismatches in a 
given template bank constructed with a certain maximal 
mismatch m depends on the structure and dimension- 
ality of the template bank. The corresponding average 
mismatch can be expressed as (fi) = £ m, where £ € (0, 1) 
is a characteristic geometric factor of the template bank. 
Such mismatch distributions were studied quantitatively, 
for example in |33j . For hyper-cubic lattices, the geomet- 
ric relation is well known to be exactly (/i) = m/3, which 
was used in previous optimization studies [T7l [21] . For 
the more efficient A* n -lattices this geometric factor is ap- 
proximately £ « 0.5 — 0.6 for low dimensions n < 6. Here 
we allow for general geometric factors £, but for simplicity 



() 



we assume it to be identical for the fine- and coarse-grid 



template banks, and so Eq. (22 1 can be written as 



CP) = £(m + in) , with £, e (0, 1) 



(23) 



where fh and m are the maximal mismatch parameters 
of fine- and coarse-grid template banks, respectively. 



Averaging the non-centrality parameter p| of Eq. ( 19 ) 
over random signal parameters A s at fixed signal strength 
/i rms , we can now obtain the expression 



(p!) = [1 - £(m + m)] 



2iVdc 
5 n 



h T 



(24) 



where we (trivially) generalized the result to the case of 
a network of iVdet detectors. In this case S n refers to the 
harmonic mean over individual-detector PSDs, and h lms 
is a noise-weighted average over rms-amplitudes from dif- 
ferent detectors (e.g. see [28]). The fact that both the 
coarse- and fine-grid mismatches enter this expression has 
been overlooked in previous studies [TTJ [21] , where only 
the fine-grid mismatch fh had been included 1 . 



C. Sensitivity estimate 

The false-alarm and false-dismissal probabilities for a 
given threshold S t h of the StackSlide statistic E of Eq. ([2| 
are 

/>oo 

Pf A (£th)= / xLv(£;0)d£, (25) 

xljv( s ;p|) ds . ( 26 ) 

-oo 

where the special case of a coherent ^-statistic search 
corresponds to N = 1. 

Sensitivity is often quantified in terms of the weakest 
(rms-) signal strength hth required to obtain a given de- 
tection probability p*j ct = 1 — p*^ at a given false-alarm 
probability p* fA . This requires inverting Eq. (25) to ob- 
tain the critical threshold 



£th(p 



[A J 



then substi- 



tuting this into Eq. ( 26 ) and inverting to find the critical 



non-centrality parameter 



pf = pl(PtA,P*{D,N). 



(27) 



The signal location A s is generally unknown, there- 
fore the mismatch p(X s , A) of the closest template A and 
the correspondin g mi smatched non-centrality parameter 
p|;(A s ,A) of Eq. (19 1 follow a random distribution. In 
order to estimate the threshold rms signal strength hth, 
one would have to compute P[d(Pia, hth) by averaging 



the right-hand side of Eq. ( 26 ) over the (known) mis- 
match distribution of p. Furthermore, for statements 



about physical upper limits and sensitivity of a given 
search pipeline, it is often required to quantify the sensi- 
tivity in terms of the intrinsic GW amplitude ho, instead 
of the rms detector strain h rms , which would require fur- 



ther averaging of Eq. ( 26 ) over the (potentially) unknown 
sky-position and polarization parameters. This problem 
has recently been studied in detail in [53] . 

For our present purpose it will be sufficient to obtain 
the correct scaling of sensitivity with StackSlide param- 
eters {N,T,m,m}, while the absolute sensitivity level is 
less important. We will therefore employ the usual sim- 
plification of this problem, which consists in averaging 
p\ instead of P[d(Py,) over the mismatch distribution of 
p, so we approximate 



(28) 



The results of [34] indicate that this indeed approxi- 
mately preserves the scaling of sensitivity as a function 
of StackSlide parameters. 



Pro (Pf a ,hh) = \ Pro (Pf a ,Ps)\ hth 
-Pro (PfA,(p|> ; 



We can now use Eq. (24 1 to translate the critical non- 
centrality parameter of Eq. (27 1 into a threshold rms 
signal-strength hth, namely 



, _ 2 2N dct T 
hh = —P2~ I 1 - ?( m + m )\ ~q~ 



(29) 



Following the Neyman-Pearson criterion we want to max- 



imize detection probability p^ c 



1 



Pro 



at fixed 



false-alarm probability Pf A and at fixed signal strength 
fc lms . Equivalently 2 we can fix the false-alarm and false- 
dismissal probabilities and minimize the required thresh- 
old rms signal strength /i t h, which is the traditional op- 
timization approach used in previous studies |171 121| . 



1. Gauss approximation for large N 

One approach (used in [19l [21] ) to make further ana- 
lytical progress consists in assuming a large number of 
segments, i.e. N 3> 1, and invoke the central limit theo- 
rem to approximate \\n by a Gaussian distribution 



-1/2 



exp 



(S-S) 

24 



(30) 



with mean and variance of xtivCPs) given by 

£ = 4N + p\ , 
ol = 2(47V + 2pl) . 



(31) 



1 These studies additionally imposed the ad-hoc constraint of 
fh = fh in the computing-cost expressions 



Due the monotonicity of pfrj as a function of h r 
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This allows us to analytically integrate Eqs. (25), (26), 
which yields 



PfA(S 



ih 



= l eric 

PfD(Sth,Ps) = o erfc 



2^4/V 



Pi 



(E th ~ 47V) 



2^4^ + 2^ 



(32) 
(33) 



where erfc(x) = 1 — erf (a;) is the complementary error- 
function. Substituting Eq. ( 32 1 into Eq. ( 33 ) , we obtain 



Pi 



2ay/AN 



2v/4iV + 2p| ' 

where we defined 

^^erfc" 1 (2^) = -erfc^ 1 (2^ ct ) : 
a = erfc _1 (2pf A ) . 



(34) 



(35) 



Solving Eq. ( 34 ) for the critical non-centrality parameter 
p^?, we obtaim 



p*i (a, P, N) = 2aV4N + Af3 2 



2/3V4A r + 4aV4iV 



(36) 



which we refer to as the "Gauss approximation" . In- 
troducing the average per-segment SNR pjr as p 2 -p = 
(p^j/N, one can consider two interesting limits of the 
false-dismissal equation (34): 



(i) strong-signal limit (pjr 3> 1): the per-segment SNR 
of the signal is large, and we obtain 



V8/3, 



(37) 



which is somewhat pathological, as /3 > 1 and 
therefore the detection probability is extremely 
close to pdet = 1- Neither false-alarm threshold nor 
the number of segments N matter for detectability 4 
in this case. 

(ii) weak-signal limit {p 2 T <C 1): the per-segment SNR 
of the signal is small, and using TV ^ 1 we find 



f AN(a + 13), 



which we refer to as the "weak-signal Gauss ap- 
proximation" (WSG), which was first used in [T5] 
to estimate the sensitivity of the Hough method. 
This approach results in the "classic" semi-coherent 
sensitivity scaling as a function of N , namely 



'th ,WSG 



iVdct [J_ 

2S n 



£(m + m)] T 



(39) 



3 The second solution has j3 < 0, corresponding to pfrj > 0.5. 

4 This has been noted previously for radio observations [35] 



In practice we find that the WSG approximation is 
often not well satisfied, and the deviations of the N- 



scaling in Eq. ( 38 ) from the exact form of Eq. ( 27 ) can 



lead to dramatically different optimal solutions. Already 
the Gauss approximation of Eq. p6| is not well satisfied 
for small false-alarm probabilities pfA <C 1 and segment 
numbers in the range N < O (1000), as can be seen in 
Fig. [2] A more reliable approximation was recently in- 
troduced in |34j , namely using the Gaussian distribution 
only for the false-dismissal equation ( 26 1 , while keeping 
the central ^-distribution for the false-alarm equation 
(25). For the present work this approach would not be 
well-suited, however, as we need the sensitivity equa- 
tion in the form of a power-law in T and N, similarly 



to Eq. (39). 



2. Local power-law approximation for 

We can incorporate the exact iV-scaling of the criti- 
cal non-centrality parameter p^? of Eq. (27) by locally 
expressing it as a power-law in the form 



(40) 



where wfpf^p^), No) is a parameter quantifying the rela- 
tive deviation of the exact A^-scaling from the WSG limit 
of Eq. p8| , where w = 1. The power-law coefficients can 
be computed as 



w= 2 



dlosN 



*2 AT -l/(2w) 

r = p s iV 



(41) 



evaluated at a point {p^^p^, Nq}. 

The function w(N) is shown in Fig. [2j for a refer- 
ence false-dismissal probability of p^ = 0.1 and different 
choices of false- alarm probability Pf A , both for the ex- 
act solution Eq. ( 27 ) and for the Gauss approximation of 
Eq. ( 36 1 . We see that the exact iV-scaling w increasingly 
deviates from the WSG approximation (w — 1) at lower 
false-alarm probabilities and at smaller N. The Gauss 
approximation tends to agree better with the exact scal- 
ing at larger iV (as expected), and at higher false-alarm 
probabilities. 

Using the power-law approximation of Eq. ( 40 1 , we can 



(38) now express the threshold signal strength of Eq. ( 29 ) as 



Lq(N, T, m, fh) 



rp S n _ 2 
2N dct ftth 



[1 - ^(m + fh)] TN- 1 '^ , (42) 



which defines the objective function Lq that we want to 
maximize as a function of the StackSlide parameters. 

We see that, without further constraints the optimal 
solution would simply be m — > 0, N — > 1 and T — > 
Tmax, i-e. a fully coherent search over all the available 
data T max with an infinitely fine template bank. This 
would obviously require infinite computing power, and 
we therefore need to extend the optimization problem by 
a computing-cost constraint. 



3.5 
3 

2.5 
2 

1.5 
1 
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PfA = 10-2 * 

PfA = 10" 5 n >< _ 
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FIG. 2: ^-scaling coefficient w defined in Eq. (401 as a 
function of N, for faise-dismissai probability Pf D = 0.1, and 
different false-alarm probabilities Pf A € [10 -10 , 10 -5 , 10~ 2 ]. 
Solid lines show the scaling obtained from the exact solution 
Eq. (27 1, while dashed lines refer to the Gauss approximation 
of Eq. (361. The WSG approximation corresponds to w = 1. 



D. Template counting 

For both the coarse 5 and the fine grid, the respective 
number of templates 6 {Af, Af} covering the parameter 
space P is given (3*2 ES] by the general expression 



Af = n m-"/ 2 V„ , with V„= / y/detgcTX, (43) 



where to is the maximal-mismatch parameter, det g is 
the determinant of the corresponding parameter-space 
metric gij, and V n denotes the metric volume of the n- 
dimensional space T„ C P spanned by the template-bank. 
The normalized thickness n depends on the geometric 
structure of the covering, for example — ?W 2 2~™ for 
a hyper-cubic lattice Z n . 



An important subtlety in Eq. ( 43 1 is the dimensional- 
ity n of the template-bank space T„, which can be smaller 
than the dimensionality of the parameter space P, as pre- 
viously discussed in jTTJ [5T] . The template-bank dimen- 
sionality n is generally a (piece- wise constant) function of 
the StackSlide parameters {N, AT, to}, which determine 
the metric resolution. The extent of P along certain di- 
rections can be "thin" compared to the metric resolution 
and would require only a single template along this direc- 
tion, effectively not contributing to the template-bank di- 
mensionality. For different StackSlide parameters, how- 
ever, the resolution might be sufficient to require more 



5 We assume a roughly constant number of coarse-grid templates 

across all segments. 

6 The templates in this formulation are not to be confused with 
the "patches" used in BC Q7] and CGK [21]. A "patch" in the 
BC/CGK framework corresponds to a line of templates along 
the frequency axis. 



than one template along this direction, adding to the 
template-bank dimensionality n. 

Following [21 [T71 [5T], the correct dimensionality for 
given StackSlide parameters can be determined by the 
condition that n should maximize the number of tem- 



plates Af n computed via Eq. (43), i.e. 

Afn = max AC, , and Mtt — max AC, 



(44) 



This can be understood as follows: if Af decreases when 
adding a template-bank dimension, then the correspond- 
ing parameter-space extent is thinner than the metric 
resolution and therefore adds "fractional" templates. On 
the other hand, if A^ decreases by removing a dimension, 
then its extent is thicker than the metric resolution and 
requires more than one template to cover it. 

An interesting alternative formulation can be obtained 
by expressing Eq. (44 1 as the condition N n /N n -i > 1 
for including an additional dimension n. For constant 
metrics and simple parameter-space shapes, i.e. V n = 
J y/gd n X = Jg AAi x AA2 . . . x AA n , this can be shown 
to be equivalent to 



n AA„ 
9 n _i d\ n 



> 1 



(45) 



where d\ n = ^/mg nn is the metric template extent along 
dimension n, 



in terms of the diagonal eleme nt g 



of 



the inverse metric g tJ . This shows that Eq. (44) boils 
down to (apart from the lattice-thickness ratio)the re- 
quirement that the parameter-space extent AA„ along a 
given dimension n must exceed the corresponding metric 
template resolution d\ n . 

The coherent (coarse-grid) metric volume Vn is typi- 
cally a steep function of the coherence time AT, and can 
often be well approximated (over some range of AT) by 
a power law, namely Vs(AT) oc AT 9 . We can therefore 
write Eq. ( 43 1 for Af in the power-law form 



fifa (to, AT) = km~ n/2 AT q . 



(46) 



where k = 6>sVs(AT ) AT 9 for some choice of segment 
length AT . 

The semi-coherent (fine-grid) metric volume Vn gen- 
erally depends on both AT and N and can typically 
Q21 [23 [32] be factored in the form 



Vn(N,AT) = ln (N)V n (AT) 



(47) 



in terms of the refinement factor 73 (N) > 1 and the 
coherent-metric volume V^(AT) of the fine-grid template 
space. Typically j(N) can be well approximated (over 
some range of N) by a power law, namely 7(A) oc N p . 
We can therefore write Eq. ( 43 ) for A^ in the power-law 
form 



A4 (to, AT, N) = km- H/2 AT 9 N p . 



(48) 



where k 
parameters {ATq, Nq} 



0nVn(AT o ,N o )AT- q N o ' p for some choice of 
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E. Computing-cost model 



where 



The total computing cost C\ t of the interpolating 
StackSlide statistic has two main contributions, namely 



C tot (m,m,A,AT) = C + C, 



(49) 



where C(m, N, AT) is the computing cost of the T- 
statistic over the coarse grid of Af„ templates for each 
of the N segments, and C(rh, N, AT) is the cost of in- 
coherently summing these /-values across all segments 
on a fine grid of A/jj templates. Note that we neglect all 
other costs such as data-IO etc, which for any compu- 
tationally limited search will typically be much smaller 
than Ciot. 



1. Computing cost C of the coherent step 

The computing cost of the coherent step is 

C{fh, N, AT) = NJ\fn(m, AT) N dct ci(AT) , (50) 

where c\ (AT) is the /"-statistic computing cost of a single 
template for a single segment and a single detector. Here 
we used the fact that to first order [31] the number of 
detectors has no effect on the number of templates Af '. 

As discussed previously in |21j . there are two funda- 
mentally different implementations of the /-statistic cal- 
culation currently in use: a direct SFT-method [37] . and 
a (generally far more efficient) FFT-method based on 
barycentric resampling [531 EE] ■ 

(i) The SFT-method consists in interpolating fre- 
quency bins of short Fourier transforms ("SFTs") 
of length TgpT, using approximations described in 
[25] [37]. The resulting per-template cost ci(AT) is 
directly proportional to the segment length AT: 



-SFT 

c i 



(AT) 



T^FT 



AT 

Tc 



SFT 



(51) 



where Cq FT is an implementation- and hardware- 
dependent fundamental computing cost. 

(ii) In the FFT-method the cost of searching a fre- 
quency band A/ using an (up-sampled by u) FFT 
frequency-resolution of u/AT is proportional to 
Aff log 2Aff , where Aff — uAf AT is the number 
of frequency bins. We can therefore express the 
per-template /"-statistic cost ci(AT) as 



~FFT 



(AT) 



~FFT 



log(2uA/AT) . 



(52) 



where Cq FT is an implementation- and hardware- 
dependent fundamental computing cost. 



Using the power-law model of Eq. ( 46 ) for Af, we can 



write the coherent computing cost in the form 



C(fh, N, AT) 



-n/2 j^ti fcrpS 



(53) 



r/ = l, 5 = q + AS, 



and where AS is either 



AS 



AS 



SFT 



A<5 FFT = pog(2«A/AT )]" 



(54) 



(55) 



depending on whether the /-statistic is computed using 
the SFT- or FFT-method, respectively. The expression 
for A<5fft can be obtained via Eq. ( 62 1 and depends (al- 
beit weakly) on the reference segment length ATq. The 
corresponding proportionality factors H are found as 



TlSFT 



K SFT = 9n N, 



dct 



V K (ATo) 



Tsft AT 9 

-^FFT 



KpFT 



N de 



AS 



FFT 



Vn(ATp) 

at] 



(56) 



2. Computing cost C of the incoherent step 

The computing cost of the StackSlide step is 

C(m,N,AT) = NAfn(m,AT,N)co, (57) 

where cq is the implementation- and hardware-dependent 
fundamental cost of adding one value of 2/^ for one fine- 
grid point A in Eq. (J2J) , including the cost of mapping the 
fine-grid point to its closest coarse-grid template A&(A). 
The incoherent step operates on coherent multi-detector 
/-statistic values, and therefore does not depend on the 
number of detectors A^et- 



Using the power-law model of Eq. ( 48 ) for Af, we can 

(58) 



write the incoherent computing cost as 

C(m, N, AT) = k m- H/2 A™ AT 
where 

Tj = p + 1 , S = q, 
and the proportionality factor 

Vs(A ,AT ) 



CO 



N*AT« 



(59) 



(60) 



for given reference values {No, ATq}. 



3. General power-law computing-cost model 



Combining Eqs. (|53[) and ( 58 ) we arrive at the follow- 



ing power-law model for the total computing cost, defined 
in Eq. ( 49 1 , namely 



Ctot = « m~ n/2 N v AT 5 + k m- Tl ' 2 A' 7 AT* . (61) 
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If a given computing-cost function does not follow this 
model, we can always produce a local fit to Eq. (61 1, 



which should be valid over some range of parameters 
{AT, N}, namely 



6 = 



dlogC 
d log AT 



dlogC 
dlogN 



= C(m ,jV ,ATo) 



(62) 



(63) 



for reference values {m , N , AT }. Note that S gen- 
erally depends only on ATo, while r\ depends only on 
No, due to the w ay these dependencies typically fac- 
tor (cf. Sec. HIE). The mismatch dependency oc to - ™/ 2 



is exact according to Eq. ( 43 ) , but a given computing- 



cost function might still deviate from this behaviour 
(e.g. the BC/CGK computing-cost function discussed in 
Sec. VC). In this case one can extend the power-law fit 
by extracting the "mismatch-dimension" n via 



n = -2 



dlogC 
<9 log TO 



(64) 



It will be more convenient in the following to work in 
terms of {N,T} instead of {N, AT}, where T = N AT 
is the total time span of data used. Changing variables, 
we obtain the computing-cost model in the form 

C tot = Km-" /2 N-zT I +Km- n/2 N- t T S , (65) 
where we defined 



V, 



(66) 



generally satisfying e > for all realistic cases considered 
here. Note that to and N are dimensionless, therefore the 
respective units of [C/k] are [T s ]. 



IV. MAXIMIZING SENSITIVITY AT FIXED 
COMPUTING COST 

We want to maximize the objective function Tq oc 
defined in Eq. (42) under the constraint of fixed com- 
puting cost, Ctot = Co. We therefore need to find the 
stationary points of the Lagrange function 



L(N, T,m,m,m)=L + m[C + C - C ] 



(67) 



where stationarity with respect to the Lagrange multi- 
plier, i.e. d^L = 0, returns the computing-cost constraint 

c + c = c . 

Table [T] provides a "dictionary" summarizing the nota- 
tion used here and in the previous section to formulate 
the optimization problem. 

Before embarking on the full optimization problem, it 
is instructive to consider two special cases, namely (i) a 
fully coherent search, and (ii) searches where the comput- 
ing cost is dominated by one contribution, either coherent 
C or incoherent C. 



A. Special case (i): Fully coherent search 



The fully coherent search is a special case of Eq. ( 67 ) 
with the additional constraint N = 1, and therefore 
AT = T, m = 0, and C = 0. This leaves us with the 
reduced Lagrangian 

L(T, m, w) = (1 - £m) T + w [k to~ s/2 T 5 ~ C ] . (68) 

Requiring stationarity with respect to {T, to, m} results 
in the optimal StackSlide parameters 



(69) 
(70) 




~ I m opt 



Interestingly the optimal mismatch TO op t is independent 
of both the computing-cost constraint C and the ob- 
servation time T. The scaling of the resulting threshold 
signal strength h t h with computing cost Co is therefore 



oc C, 



1/(25) 



(71) 



In practical applications we often find 5 w 3 — 7, and 
so T op t and will increase very slowly with increas- 
ing computing cost Co . This indicates that a brute- force 
approach of throwing more computing power at a fully 
coherent search will typically yield meagre returns in sen- 
sitivity. 



Special case (ii): Computing cost dominated by 
one contribution 



If either the coherent C or incoherent C contribution 



dominates the total computing cost (65), we can write 



C t , 



-n/2 J\J~£ 



(72) 



where all StackSlide parameters now refer to dominant 
contribution only. 

We assume that the negligible computing-cost contri- 
bution implies that we can also neglect the corresponding 
mismatch: if the respective step is cheap, one can easily 
increase sensitivity by reducing the corresponding mis- 
match until it is negligible, i.e. we assume (fJ. S c) ~ £ m - 
This qualitative argument will be confirmed by the gen- 
eral solution in the next section. We can therefore write 



the objective function Eq. (42) as 



L {N,T,m) ps (1 -£m) N' 1 '^ T . 



(73) 



Using Eq. ( 72 ) we can obtain 



N(C 0l to, T) = (C'o/k)- 1 ^ to-™/( 2£ ) T s ' £ , (74) 
AT(C , to, T) = (C /k) 1/£ to™/( 2£ ) T-"/ £ , (75) 
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Symbol 
N 
AT 
T 

Q 
Q 

n 

m 

— 

K 

5 



Description 
Number of segments 
Segment duration 
Total observation time 
a quantity Q referring to the coherent step 
a quantity Q referring to the incoherent step 
number of template-bank dimensions 
maximal template-bank mismatch parameter 
average mismatch factor € [0, 1] 
Lo sensitivity scaling with TV 

Lagrange multiplier for computing-cost constraint L 
computing-cost prefactor 



Relations 



T = N AT 



L 0\ m ,T 

Lo + ro(C to t — Co) 



{ m 
oc N- 1 /^ 



computing-cost T- or AT- exponent at fixed TV 
computing-cost TV-exponent at fixed AT 
computing-cost TV-exponent at fixed T 



C\ N oc AT oc T 
C| AT oc TV'' 
C\ T oc N- £ 



Refs 
EqT 



III 



III 



III 



Sec. 
Sec. 
Sec. 
Sec. 
Sec. 
Eq. 

Eqs. iS\, 1151 



III 



III 

|44| 



Eq. (23 1 



Eq. (|40 1 



Eq. (671 







Eqs. |53l, (581 



Eqs. (531,(581 



Eqs. (531,(581 



Eq. (65 I 



TABLE I: Overview of symbols and notation used in the formulation of the optimization problem. 



which shows that increasing T at fixed Co results in more 
and shorter segments, while increasing Co at fixed T re- 
sults in fewer and longer segments (assuming e > 0). 
Substituting this into Eq. (73) yields the threshold sig- 
nal strength 



/i t " h 2 ex {C /K) 1, ^ we) (1 - f» TO™/( 4t0£ ) 



where we introduced the parameter 
a = 2we — S , 



rpa/ (2we) 



(76) 



(77) 



which will be of critical importance in determining the 
character of the optimal solution. 

The objective function L oc can be easily maxi- 
mized over mismatch to, resulting in 



c (o) 

?™opt 



1 



4:ive 
n 



(78) 



which is independent of both Co and T. This solution 
differs from Eq. ( 69 ) of the fully coherent case, even when 
the coherent cost dominates (where e = 5 — 1). 

We see in Eq. ( 76 ) that there is no extremum of /ith 



(at least in regions of approximately constant power-law 
exponents). Given that w > 1 and generally e > 0, we 
can distinguish two different regimes depending on the 
sign of critical scaling exponent a defined in Eq. (77): 



a > 0: sensitivity improves (i.e. increases) with in- 
creasing T (at fixed Co). Therefore sensitivity is 
only limited by the total amount of data T max avail- 
able. 

a < 0: sensitivity improves with decreasing T, so one 
should use less data (until the assumptions change) . 

In practice these extreme conclusions will be modified, 
as the power-law exponents will vary (slowly) as func- 
tions of N and T, and the assumption of a dominating 



computing-cost contribution might also no longer be sat- 
isfied. The marginal case a = marks a possible sensi- 
tivity maximum, namely if increasing T results in a < 
and decreasing T leads to a > 0. 

We can obtain a useful qualitative picture of the full 
optimization problem by considering the two extreme 
cases of dominating computing contribution C or C: 

• if C C: we always have a > (for all cases 
of interest rj — 1, S > 2 and w > 1). Therefore 
sensitivity improves with increasing T . As seen in 



Sec. IV C 3 this shifts computing cost to the inco- 
herent contribution. Eventually one either uses all 
the data T max or the coherent cost no longer dom- 
inates. 

• if C 3> C: the incoherent parameter a can have 
any sign. If a > one would increase T until all 
the data T max is used (or we reach a = 0). If a < 
one would decrease T until the incoherent cost no 
longer dominates. 

These limiting cases show that the type of optimal solu- 
tion will be determined solely by the incoherent critical 
exponent a = Iwe — 5, namely 



T (p) _ 

J opt — 



finite if a < , 
oo otherwise , 



(79) 



which we refer to as the bounded and the unbounded 
regime, respectively. 

C. General optimality conditions 

We now return to the full optimization problem of 
Eq. (67), namely 



L = L + w [C + C - C ] , 



(80) 
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where 

L Q = [l-i{m + fh)]TN- 1 ^ 2w \ (81) 
C = Km- Tl/2 N^T* , (82) 
C = 2m- Tl/2 N~"T~ S . (83) 
It will be useful introduce the computing-cost ratio 

x = C/C, (84) 
and express the respective contributions as 



C = ^° C = — 

1 + x~ x ' 1 + x 



(85) 



Using Eqs. (82 1, (83) to solve for T and N, respectively, 
we obtain 



N D = (gb/g)! [m-^Cl + x- 1 )]' 

(C /k) ? [™-S/2( 1 + ^)] 5 



r 



(C /K) f [a-«/2(l + x )]'« 



(86) 
(87) 



where D is the determinant of the matrix [S,rj;S,fj], 
which for all cases of practical interest seems to be posi- 
tive definite, namely 



D = 5rj - 5 r] > . 
The segment length AT = T/N can similarly be obtained 



as 



ATg= (Co/g [**-«/' (1 + *)]^ 
(Cb/zs)" [m-s/2(i + x -i)]" 



1. Stationarity with respect to mismatches {m, m} 

Requiring stationarity with respect to the mismatches, 
i.e. df^L = df^L = 0, yields 



voC = — 2£ TN' 1 ^ 2 ™) 
n 

W C = -2£ ^ TN -i/l*») 
n 

which results in the remarkable relation 



(90) 



''opt 



jfl 



(91) 



The ratio of optimal mismatch per dimension is simply 
given by the computing-cost ratio x. This result confirms 



an assumption made in Sec. IV B about the optimal solu- 
tion, namely that a negligible computing-cost contribu- 
tion also implies that one can neglect the corresponding 
mismatch. 



2. Stationarity with respect to number of segments N 

Requiring stationarity with respect to N (treated as 
continuous), i.e. d^L = yields 



L + 2ww eC + eC = 0, 



and substituting Eqs. (90) and (81), we obtain 



^opt 



,(0) 
'opt 



fc opt 

7(0) 



(92) 



(93) 



where we used the asymptotic optimal mismatches m^ t 
defined in Eq. (78) for the two limiting cases of dom 



inating coherent or incoherent computing-cost, respec- 
tively. Equation ( 93 1 can be interpreted as defining 
a two-dimensional ellipse in ^fra wit h semi-major axes 



m^pj. Combining this with Eq. (|9l| we obtain the op- 



timal mismatches 



n 



m opt 
n 



rn. 



opt 



'(0) 
%pt 

n 

7(6) 



-(o) 
n 

7(6) 

% P t 



(94) 



which reduces to the limiting cases of Eq. (78) when 



cither computing cost dominates, i.e. when x < 1 oi 
x>l. We can express the optimal mismatch prefactor 
in Eq. ((811 as 



[l-£(m + m)] . 



1 nx + n 



Aw ex + e 



(95) 



The optimal mismatches Eq. (94) only depend on the 



computing-cost ratio x. Substituting into Eq. rt87J) we 
therefore obtain a relation of the form To = T(Cq, x op t) 
for given observation time Tq, which can (numerically) be 
solved for x opt = x(Cp, Tp). Similarly, one could specify 



N and solve Eq. (86) for x opt — x(C ,N ). In either 
case the optimal mismatches are obtained from Eq. ( 94 ) 



and the optimal number and length of segments from 
Eqs. (86) and (87), fully closing the optimal solution at 



fixed T 



3. Monotomcity relations with T 

It is interesting to consider the behaviour of the opti- 
mal "fixed- T" solution of the previous section as a func- 



tion of T. We see in Eq. ( 94 ) that m op t is monotonically 



increasing with x, while m opt is decreasing, i.e. 

dxfh op t > 0, and d^fh opi < . 



(96) 



We generally assume D = Srj — Srj > and e > which 
implies that the right-hand side of Eq. (87) is mono- 



tonically decreasing with x, while the left-hand side is 
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monotonically increasing with T. Therefore x must be 
montonically decreasing with T, i.e. 



dxx < . 



(97) 



Therefore the optimal solution shifts computing cost 
from the coherent to the incoherent step with increasing 
T, which had already been used in Sec. |IVB| Combining 
this with Eq. ( 96 ) we find 



drTOopt < 0, and d T m opt > . 



(98) 



and using this with Eqs. (86) and (89), we can further 
deduce 



d T N opt > , and d T AT opt < , 



(99) 



namely increasing T results in more segments of shorter 
duration. 



4- Stationarity with respect to observation time T 

Requiring stationarity of L with respect to T, i.e. 
8tL = 0, yields the final condition 



Lq + m 



SC + 6C 



0. 



which combined with Eq. ( 92 ) results in 
aC + aC = 0, 



(100) 



(101) 



where the critical exponents a are defined in Eq. ( 77 1 



We generally expect a > 0, as discussed in Sec. IV B 



and therefore the stationarity condition can only have a 
solution if 

a = 5-2?j+2(w-l)e < 0. (102) 
This conclusion is consistent with the analysis of 



Sec. IV B a < characterizes a bounded regime with fi- 



nite optimal T^ t , while a > characterizes an unbounded 



opt ' 
(0) 



regime with T^ pt — > oo 



If Tq°^ exceeds the available data T„ 



<-max) then we sim- 
ply apply the fixed- T solution of Sec. |IV C 2| Otherwise 
Eq. 11011) directly yields 



''opt 



a 
a 



(103) 



closing the optimal solution via Eqs. (94), (86) and (87) 



5. Monotonicity relations with Co 

For a bounded optimal solution with T^ pi < T max , we 
see from Eq. (103) that x opt and {m opt , fh opt } are inde- 
pendent of the computing-cost constraint Cq. Inserting 



Eqs. (86 1,(87 1 into Eq. (42), we can therefore read off the 
scaling 



,1 n (a-a)/(iwD) 



(104) 



which shows that any "reasonable" search should satisfy 

a>a, (105) 

in order for sensitivity to improve with increasing Co (as- 
suming D > 0). Furthermore, from Eqs. (86), (87) and 
(89) wc obtain the monotonicity relations: 



dco^opt oc <5 - <5, 
dco^opt oc e - e , 
c?c AT op t oc rj- rj. 



(106) 



We expect rj > rj = 1, therefore the optimal segment 
length AT opt will generally increase with Cq. 

The behaviour of the optimal number of segments is 
less clear-cut: if S < S then iV op t decreases with Co, which 
can result in a fully coherent search being optimal, de- 
spite T^l < T max . A StackSlide search is therefore not 
guaranteed to be more sensitive than a fully coherent 
search at the same computing power, even when compu- 
tationally limited. 

Similarly, T opt can either increase with Co (if e > e) , 
or decrease: a more expensive and more sensitive search 
can be using less data. 



V. EXAMPLES OF PRACTICAL APPLICATION 

In order to illustrate the practical application of this 
analytical framework and its potential gains in sensitivity 
we consider a few different examples of CW searches. 

A. Directed searches for isolated neutron stars 

Directed searches target NSs with known sky-position 
but unknown frequency and frequency derivatives, i.e. 
{/i /)/)•■ •}■ The approximate phase metric of this pa- 
rameter space for isolated NSs is known analytically and 
constant over the parameter space, e.g. see [Eq. (10) in 
|15j]. The number of coarse-grid templates scales as 



M cx AT n(,l+1)/2 



(107) 



while the refinement of the semi-coherent metric [Eq. (92) 
in scales as 



7n cx AM«-D/ 2 



(108) 



The coherent computing-cost exponents Eq. (54) are 
therefore 



8 = n(n + l)/2 + AS, r) = l 



(109) 
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where AS depends on the ^-statistic implementation as 
given by Eq. ( 55 1 . The incoherent computing-cost expo- 
nents Eq. ( 59 ) are 



8 = n(n + l)/2, rj = 1 +n(n- l)/2, (110) 

which results in e — n — 1. 

For n > 2 the condition 5 = 8 — 2 + 2(w — l)e> holds 
as expected, while the critical boundedness parameter of 
Eq. ( 102 ) now reads as 



a = -(3 - n) - 2 + 2(w - l)(n - 1) , 

which for the first few values of n evaluates to 

a 1 = -1, 
a 2 = —3 + 2w , 
a,3 = — 6 + 4w , 
04 = — 10 + 6w. 



(Ill) 



(112) 



In the WSG limit (i.e. w — > 1) this is always a < 0, and 
therefore the search falls into the bounded regime. How- 
ever, in general w > 1 and therefore directed StackSlide 
searches can be cither bounded or unbounded. 



Directed search for Casstopeia-A 

As a concrete example we consider the directed search 
for the compact object in Cassiopeia- A (CasA). This 
search has been performed using LIGO S5 data, and the 
resulting upper limits have been published in [7 j . For the 
present example we use the search setup as originally pro- 
posed in |15j . namely a fully coherent ^-statistic search 
(using the "SFT" method, i.e. AS = 1) using data span- 
ning T = 12 days, with a maximal template-bank mis- 
match of to = 0.2. The setup assumed two detectors 
with identical noise floor S n and a 70% duty cycle, which 
we can formally incorporate as Afoot = 2 x 0.7 = 1.4 
in Eqs. (50) and (29). The parameter space spanned 
a frequency range of / 6 [100, 300] Hz and spindown- 



ranges corresponding to a spindown-age of r = 300 y, see 
|15j . The template-bank dimension for the given Stack- 
Slide parameters was determined as n = 3, r esulti ng in a 
power-law scaling of 8 — 7 according to Eq. (109). 



In order to compare sensitivity estimates of different 
search setups, we use nominal (per-template) false-alarm 
and false-dismissal probabilities of 



-10 



ph = o.i 



(113) 



We use a rough estimate of £ = 0.5 (e.g. see [Fig. 8 
in [33]]) for the geometric average- mismatch factor of 
the ^-lattice that was used in this search. Integrating 
Eqs. (25]), ([26} and solving for p* T yields p* r w 8.35. Sub- 
stituting this into Eq. (29) with fh = 0, to. = 0.2 yields 



an estimate for the weakest detectable signal h t h of the 
original ^-statistic search: 



Hh 



AS* 



5.2 x 10- 3 VUz 



(114) 



ref 



Timing a current StackSlide code using the SFT- 
method, one can extract approximate timing parameters 



^ FT = 7x 10" 8 S , 



c = 6 x 10" y s , 



(115) 



which results in a total computing cost for the original 
search 7 of Co ~ 472 days on a single cluster node. This 
number is used as the computing-cost constraint Cq for 
this example. 

First we consider an optimal coherent search as de- 



scribed in Sec. IV A namely using Eq. (69) we find 



pt 



0.18 



''opt 



0.36, 



(116) 



and using Eq. (70) this results in T opt = 13.6 days, which 
is only about ~ 13% longer then the original search pro- 
posal of |15j . The total improvement in the minimal sig- 
nal strength hth, is less than 2% compared to Eq. (114), 



which shows that the original search proposal was re- 
markably close to an optimal coherent search. 

Next we consider a StackSlide search over the same 
parameter space using the same computing cost Cq. As- 
suming the optimal solution will have segment lengths in 
the range 1 days < AT < 7 days, and a total span of 
T < 365 days, the parameter-space dimensions would be 
n — 2, n = 3 (see |15j). This results in power-law expo- 
nents (5 = 4, 8 — 6 , rj = 4, and therefore e = 3, e = 2, 
and a = 2 + 6(w — 1), a = — 2 + 4(w — 1). In order to 
simplify the example we use the WSG approximation, i.e. 
w = 1, which implies that the search would be bounded 
(a < 0). We can therefore use Eq. (103) to obtain the 
optimal computing-cost ratio as 



"Copt 



= 1 . 



(117) 



Note that when w > 1.25 we would have a > and 
therefore this search would become unbounded. From 



Eq. 



78 



;(0) 
'opt 



0.29, 



i (0) 

'opt 



0.55, and using 



Eq. (Ml we find the respective optimal mismatches as 



'opt 



0.16. 



''Opt 



0.24. 



(118) 



Using Eq. ( 63 ) we can extract the computing-cost coeffi- 
cients k w 3.14 x 10~ 17 and « w 3.12 x 10 -34 (with time 
measured in seconds), and plugging this into Eqs. (86), 



( 87 ) we find the optimal StackSlide parameters as 



Afo pt = 139, AT opt w 1.9 days, 
T op t w 266.5 days, 



(119) 



7 Using the original timing constant Cgp T = 6 X 10 7 s of |15| . we 
correctly recover the original estimate of C ~ 20 X 200 days 
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FIG. 3: Numerical optimal fixed- T solution for a directed CasA search as a function of T. The dashed vertical line indicates 
the analytical WSG-optimal solution of Eq. (1181, while the dotted vertical line corresponds to the exact optimal solution. 
Panel (a) shows the weakest detectable signal strength compared to the reference value /ith.ref of Eq. ( |114| ), for the exact /ith 
and for the WSG-approximated /iJh SG (using w = 1). (b) shows the optimal mismatch parameters m opt (T) and m opt (T), (c) 
shows the optimal computing-cost ratio *c op t(T) and (d) the optimal number of segments N opt (T) (treated as continuous). 



which is self-consistent with the initially-assumed 
template-bank dimensions, as it falls into the assumed 
ranges for AT and T. 

We can estimate the resulting sensitivity by solving 
Eqs. (25), (26), which yields « 17.3, and substituting 
into Eq. ( 29 ) we find a weakest detectable signal strength 
h th of 



Hh 



2.4 x 10" 



Hz. 



(120) 



opt 



which is an improvement on the optimal coherent sensi- 
tivity by more than a factor of two. 

Figure [3] illustrates the behaviour of the optimal so- 
lution as a function of T without using the WSG ap- 
proximation. This is obtained by numerically solving 
Eq. Q for x pt(T), which yields m opt (T) via Eq. @ 
and N opt (T) via Eq. (861). We see that the non-WSG 
approximated optimal solution results in somewhat dif- 
ferent StackSlide parameters than the WSG solution of 



Eq. (119), but it hardly gains any further sensitivity. 



Increasing the total computing cost Co would increase 
the relative advantage of the StackSlide method com- 
pared to a fully-coherent search: the coherent search 



would gain sensitivity as h 



th 



Q 



1/14 



according to 



Eq. (71 1, while the StackSlide search would gain sensitiv- 
ity as cx Cg^ 10 



according to Eq. ( 104 ) (in the WSG 



approximation) , so here the StackSlide search is more "ef- 
ficient" at converting increases of computing-power into 
gains of sensitivity. 



B. 



All-sky CW search using EinsteinOHome 



As an example for a wide parameter space all-sky 
search with massive computing power, we consider two 
recent CW searches performed on the Einstein@Homc 
computing platform EH 50], namely the StackSlide 
searches labelled 'S5GC1' and 'S6Bucket', which em- 
ployed an efficient grid mapping implementation de- 
scribed in [27] . 

An Einstein@Home search divides the total workload 
into many small workunits, each of which covers a small 
fraction of the parameter space and requires only a couple 
of hours to complete on a host machine. These searches 
consisted of roughly 10 7 workunits. The E@H platform 
delivers a computing power of order 100 Tflop/s, and 
these searches ran for about 6 months each, so we can 
estimate their total respective computing cost is of order 
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C tot ~ 10 21 flop (i.e. ~ 1 Zeta flop). Each E@H workunit 
is designed to require about the same computing cost, 
which allows us to base the present analysis on just a 
single workunit. 

The detector data used in these searches contained 
non-stationarities and gaps, and the template banks were 
constructed in somewhat semi-empirical ways that are 
hard to model analytically. In order to simplify this anal- 
ysis we assume gapless stationary Gaussian data, and we 
use the analytic metric expressions from |26) to estimate 
the number of templates. This example is therefore "in- 
spired by" recent E@H searches, but does not represent a 
detailed description of their computing cost or sensitivity. 

The two searches 'S5GC1' and 'S6Bucket' covered a 
fixed spindown-range corresponding to a spindown age of 
T = fo/f — 600 y at a reference frequency of f = 50 Hz. 
Each workunit covers a frequency-band of A/ = 0.05 Hz, 
the spindown range of A/ = 2.7 x 10 _9 Hz/s and a 
(frequency-dependent) fraction q of the sky. We can in- 
corporate the sky-fraction q by using template counts 
qAf in the computing-cost expressions, where Af are the 
all-sky expressions from [26] . For simplicity we fix the 
parameter-space dimension to n = 4, namely {sky, fre- 
quency, spindown}, and we use [Eq. (56), (50) in [2E]] 8 for 
the number of coarse-grid templates Af and the refine- 
ment factor j(N) of [Eq. (77) in [55]] (assuming gapless 
data). 

For a workunit at 50 Hz, the reference StackSlide pa- 
rameters are: 

• S5GC1: q = 1/3, N = 205, AT = 25h 

• S6Buckct: q = 1/51, N = 90, AT = 60 h 

For both searches the mismatch distributions of the 
coarse- and fine-grid template banks are not well quan- 
tified, so we simply assume hyper-cubic template banks 
(£ = 1/3) with fh — fh — 0.5, i.e. an average total mis- 
match of (/i) = 1/3. Plugging these parameters into 
the template-counting formulae of J5S], together with 
the timing constants of Eq. (1151 from the Cas-A ex- 



to improve sensitivity by increasing T until we hit the as- 
sumed upper bound of T max = 1 y, so we solve Eq. (|87|) 



ample, we find a reference per-workunit computing cost 
of C w 0.91 h for S5GC1, and C « 2.5 h for S6Bucket. 9 
Table |TT] shows the estimated sensitivity for these refer- 
ence searches assuming the same false-alarm and false- 
dismissal probabilities as in the previous section. 

We can apply the analytical optimal solution from 



Sec. IV with the extracted power-law coefficients at the 
reference StackSlide parameters found in Tabic [H] This 
initially places us into the unbounded regime (i.e. a > 0) 
for both 'S5GC1' and 'S6Bucket'. We therefore expect 



8 There are missing terms in both [Eqs. (57) and (83) in [26]] . but 
one can use their Eqs. (50) and (84) instead to compute det g. 

9 The actual E@H workunits take about 6 h to complete on a ma- 
chine with these timings, but these setups included bigger refine- 
ment factors 7 due to gaps in the data, and used rather different 
template-bank designs. 



for ^ op t(T max ), substitute into Eq. (94) for {m opt ,fh opt 



z opt max 

and obtain iV pt from Eq. ( 86 ) 



In order to find a self-consistent solution, we need to 
iterate this procedure: we extract new power-law coeffi- 
cients at the new solution, then re-solve until the param- 
eters converge to better than 1% accuracy. In the case of 
the 'S5GC1' search, the converged solution falls into the 
unbounded regime. In the case of 'S6Bucket' the con- 
verged solution falls into the bounded regime, but with 
The optimal observation time is therefore 



T (0) rp 

-'opt ^ J rr 



T op t = 1 y in both cases, and the resulting converged so- 
lutions and power-law coefficients are given in Table [H] 
We see that (under the present idealized conditions) we 
could gain ~ 23% in detectable signal strength h t h com- 
pared to the 'S5GC1' setup, and ~ 14% compared to the 
'S6Bucket' setup. 



C. All-sky search examples from CGK 

The all-sky search examples studied in CGK [3T] pro- 
vide another interesting test case for our optimization 
framework. CGK considered a multi-stage optimiza- 
tion, but we can treat their first-stage result as a single- 
stage optimization problem at fixed given computing 
cost. CGK discussed four different cases, namely a search 
for either "young" (Y) neutron stars (r = /// = 40 y) 
or "old" (O) neutron stars (r = 10 6 y), using either a 
"fresh-data" (f ) or "data-recycling" (r) mode (a distinc- 
tion that is irrelevant for our present purpose). The op- 
timized CGK StackSlide parameters and computing-cost 
constraints are found in [Tables I- VIII in [21]], and are 
summarized in Table |III| For the sensitivity estimates we 
use the same false-alarm and false-dismissal probabilities 
as in Sec. IV Al 

Note that we expect our results to improve on the sen- 
sitivity of the CGK solution, as they incorporated an 
ad-hoc constraint of m = fh = fh, and the total average 
mismatch in [Eq.(46) in CGK] incorrectly included only 
the contribution from one template grid instead of both, 
as discussed in Sec. IIIIB3I 

The functional form of the template-bank equations 
(originally from BC [17] ) in the CGK computing-cost 
model [Eq.(53) in CGK] is not consistent with the generic 
form of Eq. ( 43 ) with respect to the mismatch scaling. 



We therefore resort to extracting (potentially fractional) 
"mismatch dimensions" {n,n} using Eq. (64), in order to 
fully reproduce their computing-cost function with the 
power-law model of Eq. (|61|). The scaling parameters 



{S,r]} are extracted via Eq. (62) and w from Eq. (41). 



The resulting values are given in Table |III[ assuming the 
FFT/resampling method for the .F-statistic calculations. 

Using the extracted scaling coefficients to compute the 
optimal solution from Sec. |IV| results in a solution that 
is inconsistent with the initially extracted scaling coeffi- 
cients. An iteration over solutions, allowing both N and 
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8 


6 




w 


iV 


AT[d] 


m 


m 


C [h] 




^[vTii] 


S5GC1 


8.7 


7.7 


2.0 


1.1 


205 


1.0 


0.50 


0.50 


0.91 


2.545 


2.69 x 10" 3 


?max — 1 y 


10.0 


9.0 


2.0 


1.1 


528 


0.7 


0.14 


0.17 


0.91 


0.869 


2.19 x 10" 3 


S6Bucket 


4.6 


3.6 


2.0 


1.2 


90 


2.5 


0.50 


0.50 


2.54 


13.914 


2.20 x 10" 3 


T max = 1 y 


3.7 


2.7 


2.0 


1.2 


175 


2.1 


0.58 


0.32 


2.54 


1.815 


1.93 x 10" 3 



TABLE II: Einstein@Home example setups 'S5GC1' and 'S6Bucket', with corresponding results from an iterative optimization 
at fixed computing power Co, with assumed maximal observation time of T max = ly. The gains in weakest detectable signal 
strength h t ^ are ~ 23% and ~ 14%, respectively. 





n 


n 


8 


8 


V 


w 


N 


AT[d] 


m 


m 


Co[Zf] 








Y/r 


3.0 


4.0 


3.1 


6.0 


4.2 


1.7 


10 


2.6 


0.78 


0.78 


0.94 


0.071 


7.31 x 10" 


3 




= NcGK 


3.0 


4.0 


3.1 


6.0 


4.2 


1.7 


10 


2.1 


0.17 


0.48 


0.94 


0.482 


6.38 x 10" 


3 




Y/f 


3.0 


4.0 


3.1 


6.0 


4.2 


1.7 


9 


2.7 


0.78 


0.78 


0.82 


0.086 


7.42 x 10" 


3 




= Nqgk 


3.0 


4.0 


3.1 


6.0 


4.2 


1.7 


9 


2.2 


0.18 


0.46 


0.82 


0.526 


6.50 x 10- 


3 




O/r 


2.5 


2.5 


3.0 


10.0 


7.4 


1.8 


8 


14.8 


0.35 


0.35 


0.74 


0.004 


2.61 x 10 - 


3 


N 


= NcGK 


2.8 


2.5 


3.0 


10.0 


7.4 


1.8 


8 


14.6 


0.03 


0.34 


0.74 


0.090 


2.46 x 10" 


3 




O/f 


2.5 


2.6 


3.0 


10.0 


7.3 


1.7 


9 


11.8 


0.21 


0.21 


0.35 


0.009 


2.65 x 10" 


3 


N 


= NcGK 


2.7 


2.5 


3.0 


10.0 


7.3 


1.7 


9 


12.4 


0.04 


0.33 


0.35 


0.107 


2.56 x 10" 


3 



TABLE III: CGK example search setups for young (Y') and old pulsars ('O'), using either fresh (T) or recycling (V) data- 
modes. The first line of each example gives the original CGK solution with respective extracted power-law coefficients, and 
the second line shows our optimal self-consistent solution with constraint N = -/Vcgk- The computing cost Co is measured in 
Zeta-flop (lZf = 10 21 flop). 



T to vary, did not converge. We therefore solve a sim- 
pler problem by fixing the number of segments to the 
original CGK values, i.e. we constrain the solutions to 
N = Nqgk- We proceed by solving Eq. (86) for x op %{N), 
closing the solution via Eqs. (94) and (87). We then ex- 



tract new power-law coefficients at this solution and it- 
erate this procedure until convergence to better than 1 % 
accuracy is achieved. The resulting fixed- TV optimal solu- 
tions are given in Table [TTT] The respective improvements 
of the weakest detectable signal strength h th compared 
to the original CGK solutions are ~ 15 % in the young 
(Y) pulsar case, and ~ 5 % in the old (O) pulsar case. 



D. CWs from binary neutron stars 

For CWs from NSs in binary systems with known sky- 
position (such as Sco-Xl and other LMXBs), the search 
parameter space typically consists of the intrinsic signal 
frequency and orbital parameters of the binary system, 
i.e. (projected) semi-major axis, orbital period P, pe- 
riapse angle, eccentricity and eccentric anomaly. The 
corresponding template-counting formulae were initially 
studied in [JT] for coherent searches. These have re- 
cently been extended to semi-coherent searches by Mes- 
senger [32] , giving explicit template scalings in two limit- 
ing cases, namely (i) short coherent segments compared 
to the orbital period, i.e. AT <C P , and (ii) long coherent 
segments, i.e. AT >• P. 



(i) Short coherent segments (AT <C P) 

One can change parameter-space coordinates and 
Taylor-expand in small AT/P <C 1 to obtain the coher- 
ent template scaling [Eq. (24) in [32]]: 



Nn cx AT n(n+1)/2 . 



(121) 



where n is the effective coherent parameter-space dimen- 
sion using the new coordinates. The coherent cost power- 
law coefficients are therefore S = n(n + l)/2 + A5 and 
rj = 1. 

The semi-coherent template scaling including eccen- 
tricity results in a 6-dimensional semi-coherent template 
bank, i.e. n = 6, and a template scaling [Eq. (28) in [32]] 

of Afecc <x N AT 7 . In the case of small eccentricity one 
has n = 4, and the template scaling given in [Eq. (29) in 
[32] ] is A/" c i rc oc N AT 5 . In both cases the semi-coherent 
power-law exponents satisfy 5 > 5, and rj — 2, resulting 
in the critical parameter a > 0. This implies that the 



boundedness-condition Eq. ( 102 ) is always violated, i.e. 
one should use all the availat 



ale data. 



(ii) Long coherent segments (AT 3> P) 

In this limit the template scalings in both the coherent 
and semi-coherent step are [Eqs. (32,33) in [32]]: J\f cx 
Af oc AT 2 , which is unusual as there is no refinement. 
Therefore rj = rj = 1, and 5 = 2 + AS, while 6 = 2, 
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and therefore e — 1. We see that always a > and 
a = 2(w — 1) > 0, and therefore binary-CW searches 
in the long-segment limit also fall into the unbounded 
regime, i.e. one should use all the data. 



VI. DISCUSSION 

We have derived an improved estimate of the Stack- 
Slide sensitivity scaling, correctly accounting for the mis- 
matches from both coarse- and fine-grid template banks, 
which had been overlooked by previous studies. By 
locally fitting sensitivity and computing-cost functions 
to power laws we are able to derive fully analytical 
self-consistency relations for the optimal sensitivity at 
fixed computing cost. This solution separates two dif- 
fere nt reg imes depending on the critical parameter a of 
Eq. (102): a bounded regime with a finite optimal T^ t , 

and an unbounded regime where T^ t — > oo. 

Several practical examples are discussed in order to 
illustrate the application of this framework. The corre- 
sponding sensitivity gains in terms of the weakest de- 
tectable signal strength h t h are found to be ~ 100% 



compared to a fully coherent directed search for CasA, 
and about 5% — 20% compared to previous StackSlide 
searches such as Einstein@Home and the examples given 
in CGK [3T]. We show that CW searches for binary 
neutron stars seem to generally fall into the unbounded 
regime where all the available data should be used irre- 
spective of available computing power. 

This study only considered single-stage StackSlide 
searches on Gaussian stationary gapless data from detec- 
tors with identical noise-floors. Further work is required 
to extend this analysis to more realistic data conditions. 
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